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ABSTRACT : The production of therapeutic proteins from transgenic animals is one of the most important successes of animal 
biotechnology. Milk is presently the most mature system for production of therapeutic proteins from a transgenic animal. Specifically, 
P-casein is a major component of cow, goat and sheep milk, and its promoter has been used to regulate the expression of transgenic 
genes in the mammary gland of transgenic animals. Here, we cloned the porcine P-casein gene and analyzed the transcriptional activity 
of the promoter and intron 1 region of the porcine P-casein gene. Sequence inspection of the 5 '-flanking region revealed potential DNA 
elements including SRY, CdxA, AML-a, GATA-3, GATA-1 and C/EBP p. In addition, the first intron of the porcine P-casein gene 
contained the transcriptional enhancers Oct-1, SRY, YY1, C/EBP P, and AP-1, as well as the retroviral TATA box. We estimated the 
transcriptional activity for the 5 '-proximal region with or without intron 1 of the porcine P-casein gene in HC11 cells stimulated with 
lactogenic hormones. High transcriptional activity was obtained for the 5'-proximal region with intron 1 of the porcine P-casein gene. 
The P-casein gene containing the mutant TATA box (CATAAAA) was also cloned from another individual pig. Promoter activity of the 
luciferase vector containing the mutant TATA box was weaker than the same vector containing the normal TATA box. Taken together, 
these findings suggest that the transcription of porcine P-casein gene is regulated by lactogenic hormone via intron 1 and promoter 
containing a mutant TATA box (CATAAAA) has poor porcine P-casein gene activity. (Key Words : Lactogenic Hormones, HC11 Cells, 
TATA Box, Milk Protein, Porcine Casein Gene) 



INTRODUCTION 

The milk of transgenic animals has the potential for use 
as a bioreactor to product useful recombinant proteins or 
bioactive substances (Clark, 1998; Houdebine, 2009). 
Casein, which is one of the major components of milk 
proteins, includes calcium-sensitive caseins (aSl-, P-, and 
a-S2-casein) and K-casein (Rosen et al., 1999; Rijnkels, 
2001). Casein genes expressed in the mammary gland 
during lactation and are regulated by a variety of factors 
including steroid hormones, peptides, cell-cell and cell- 
extracellular matrix interactions (Doppler et al., 1989; 
Boudreau et al., 1995; Rosen et al., 1999). P-casein is a 
major component of cow, goat and sheep milk and its 
promoter has been used to regulate the expression of 
transgenic genes in the mammary gland of transgenic 
animals (Clark, 1998). P-casein is an abundant milk protein 
expressed in mammary epithelial cells during lactation. In 
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mammary epithelial cell lines, expression of P-casein 
depends on the synergistic action of insulin, glucocorticoid 
and prolactin (Doppler et al., 1989). Since P-casein 
promoter is regulated by the lactogenic hormone during 
lactation, the P-casein promoter has often been used for the 
generation of transgenic animals that produce proteins with 
mammary gland specific expression (Chang et al., 2010). 
The production of therapeutic proteins using transgenic pig 
is one of the most important and significant successes of 
animal biotechnology because pig has some advantage such 
as a short gestation and large litter size (Wheeler and 
Walters, 2001). 

Bovine P-casein cDNA has been identified and found to 
consist of a 672 nucleotide coding region (Baev et al., 
1987). P-casein is the major casein in bovine milk, 
accounting for as much as 30% of its total protein mass. 
The porcine P-casein cDNA sequence consists of a 696-bp 
open reading frame and encodes a protein of 232 amino 
acids in length (Alexander and Beattie, 1992). The bovine 
P-casein gene spans at least 8.5 kb and encodes 9 exons 
(Gorodetsky et al., 1988). Expression of the p-casein milk 
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protein gene has been shown to be regulated by the addition 
of glucocorticoid and prolactin hormones in the murine 
mammary epithelial cell line, HC11 (Schmitt-Ney et al., 
1991). The gene regulatory DNA sequences recognized by 
transcription factors are conserved in the promoters of 
different casein genes from several species (Mercier and 
Viloite, 1993; Rosen et al., 1999). However, transcriptional 
enhancer elements present in the intron increase the basal 
activity of the P-casein promoter. In addition, these intronic 
enhancer elements augment the induction of P-casein 
promoter by lactogenic hormones (Kolb, 2003). Casein 
cDNA and genomic DNA of many species have been 
isolated and characterized by molecular cloning. However, 
porcine P-casein genomic DNA has not yet been isolated or 
characterized. 

In the present study, we describe the exon/intron 
organization of the porcine P-casein gene. We also analyzed 
the transcription unit of the porcine P-casein gene in the 
mouse mammary epithelial cell line, HC11. We 
demonstrated that the first intron of the porcine P-casein 
gene contains a transcriptional promoter/enhancer sequence 
and that lactogenic expression of the porcine P-casein gene 
was probably regulated by the first intron. Moreover, we 
found that porcine P-casein genes promoters containing a 
mutant TATA box (CATAAAA) have poor promoter activity. 

MATERIALS AND METHODS 

Cloning of porcine p-casein gene by long PCR 

Porcine genomic DNA was extracted from the ear 
tissues of three-way crossbreds (LWD) obtained at the 
slaughterhouse. LA-PCR (Long and Accurate PCR) was 
conducted to identify the porcine P-casein genomic DNA as 
follows using primers prepared based on the porcine 
P-casein promoter region (Accession No. AY452035) 
reported to the NCBI and the result of cDNA analysis. The 
primers for PCR were as follows; porcine P-casein 
promoter sense primer (CCCACTATTTCCTGATTCTTGA 
TTAACTTT), exon 6 antisense primer (TGTTGTTCCT 
CCCGCTTTAGCTTCTCAATT), exon 2 sense primer 
(GACTTGATCGCCATGAAGCTCCTCATCCTT) and 
exon 9 antisense primer (GCCTAAGGATTAATTTATTG 
A A ATG ACTGG) . PCR was conducted using 100 ng of the 
genomic DNA extracted from porcine ear tissues with 10 
pmol of sense and antisense primers, 0.5 U i-Max II DNA 
polymerase (Intron, Korea), lxPCR buffer and 200 |iM 
dNTP mixture as follows: 2 min of initial denaturation at 
94°C followed by 30 cycles of 94°C for 30 s, 63°C for 30 
seconds, 72°C for 7 min and 30 s, followed by a final 
elongation step at 72°C for 15 min. The PCR products were 
confirmed by electrophoresis on 0.8% agarose gel, after 
which they were cloned into the pGEM T-easy vector 



(Promega, USA). Sequencing was conducted using an ABI 
PRISM 377 sequencer. The nucleotide sequence was 
analyzed by Genetyx-win (version 4.0). 

Construction of luciferase vector 

To test for promoter activity, the 5 '-flanking region 
containing promoter with and without intron 1 was fused to 
the firefly luciferase gene in pGV-B2 (TOYO INK MFG, 
Japan) as follows. For construction of luciferase vector 
containing only promoter (-2,657 to +39) of porcine P- 
casein gene, PCR was conducted using porcine P-casein 5' 
arm DNA containing normal TATA or mutant TATA box as 
a template. A sense primer (CCCACTATTTCCTGATT 
CTTGATTAACTTT) and antisense primer (CTCGAGTCC 
AAAGTGGAGGACAAGAAGTGA) containing the Xhol 
restriction enzyme site were designed for ligation in the 
multi cloning site of pGV-B2 vector. PCR was conducted 
using 10 pg of DNA and 10 pmol of sense and antisense 
primers, lxPCR buffer, 0.5 unit of i-MaxII polymerase 
(Intron, Korea) and 200 |iM of dNTP mixture. PCR was 
conducted under the following conditions: 2 min of initial 
denaturation at 94 °C, followed by 35 cycles of 94°C for 30 
s, 62°C for 30 s, 72°C for 4 min and a final elongation step 
at 72°C for 15 min. PCR products were ligated with pGEM 
T-easy vector to generate pGEM-P-Promoter plasmid 
(Promega, USA). Inserts were prepared by digestion of 
pGEM-P-Promoter plasmid with EcoRV and Xhol 
restriction enzyme. pGV-B2 vector was prepared by 
digestion of the Smal and Xhol restriction sites, after which 
it was ligated with an insert containing normal TATA or 
mutant TATA box from pGEM-P-Promoter plasmid 
(denoted p/5CElTATA and p^CElmTATA, respectively). 
For construction of luciferase vector containing promoter 
and intron 1 (-2,657 to +2,450) of porcine P-casein, PCR 
was conducted using porcine P-casein 5' arm DNA 
containing normal TATA or mutant TATA box as a template. 
For cloning in pGV-B2 vector, sense primer (CCCACTA 
TTTCCTGATTCTTGATTAACTTT) and antisense primer 
(CCATGGCGATCAAGTCCTGTGAATGGGGAA) 
containing the Ncol restriction enzyme site were designed. 
PCR was conducted with 10 pg of DNA using 10 pmol of 
sense and antisense primers, lxPCR buffer, 0.5 unit of 
i-MaxII polymerase (Intron, Korea) and 200 uM of dNTP 
mixture. PCR was conducted under the following 
conditions: 2 min of initial denaturation at 94°C followed 
by 35 cycles of 94°C for 30 s, 62°C for 30 s, and 72°C for 5 
min, with a final elongation step at 72°C for 15 min. PCR 
products were ligated with pGEM T-easy vector to generate 
pGEM-P-intronl plasmid (Promega, USA). Inserts were 
prepared by digestion of pGEM-P-intronl plasmid with 
Smal and Ncol restriction enzyme. Inserts were subcloned 
into the vector pGV-B2 to generate p/5CE2TATA and 
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p/5CE2mTATAluciferase vector. 

DNA transfection 

For luciferase vector transfection, pCMV P-galactosidase 
plasmid (Clontech, USA) was used for normalization. 
Transfection was conducted as follows. HC11 cells were 
inoculated on a 96 well plate at a density of lxlO 4 cells and 
the culture medium was replaced with fresh medium the 
next day. The cells were then transfected using jetPEI 
transfection reagent (Polyplus-transfection, France) 
according to the manufacturer's instruction. Luciferase 
vector (0.05 pmol) and pCMV-P-gal plasmid (0.02 pmol) 
were used. In the transfection experiment, parallel plates of 
cells were transfected with pGVB2 and pGV-P2, which 
served as negative and positive controls, respectively. The 
plasmid pGV-B2 lacks a eukaryotic promoter; therefore, no 
luciferase activity was detected in cells transfected with 
pGV-B2. The plasmid pGV-P2 contains the SV40 early 
promoter, which drives expression of the luciferase gene. 
The transfection efficiency was assayed by co -transfection 
with the P-galactosidase reporter, pCMV P-gal. 

Lactogeneic hormonal induction of HC11 cells 

After the transfection, cells were cultured for 24 h. The 
culture medium was replaced with serum free medium for 
48 hours with hormone treatment. Hormone treatment was 
conducted using 5 |ig/ml insulin (Sigma, USA), 1 |iM 
dexamethasone (Sigma, USA), and 5 |ig/ml prolactin 
(Sigma, USA) in serum free RPMI 1640 (HyClone, USA) 
for 48 hours. The negative control was not treated with 
hormones. 

Firefly luciferase assays 

After hormone treatment for 48 hours, the cells were 
washed two times with lxPBS and then lysis buffer (50 |il) 
was added into cells. The cell lysates were transferred to 1.5 
ml tubes. After collection, cells were immediately put into 
the liquid nitrogen. Frozen cells were then thawed at 37°C 
and centrifuged at 12,000 rpm and 4°C for 2 min, after 

A) Long-PCR 



which the supernatant was collected. For measurement of 
the luciferase activity, 20 ul supernatant and 50 ul of 
Luciferase Assay System with Reporter Lysis Buffer 
(Promega, USA) were added to a 96 well plate and 
analyzed using a GloMax 96 microplate Luminometer with 
single reagent injector (Promega, USA) according to 
manufacturer's instructions. The transfection efficiency was 
determined by co -transfection with the P-galactosidase 
reporter, pCMV P-gal. P-galactosidase activity was 
measured using a chemiluminescence assay kit (Promega, 
USA) according to the manufacturer's instructions. The 
relative light units were calculated by normalizing the 
firefly luciferase activity against the P-galactosidase activity 
to evaluate the transfection activity. 

RESULTS 

Cloning and molecular characterization of porcine 
P-casein gene 

To identify the P-casein gene in porcine cells, LA-PCR 
was conducted using two pairs of P-casein specific primers 
and porcine genomic DNA as template. PCR products 
containing promoter-exon 6 and exons 2-9 were amplified 
with 6.8 kb and 5.5 kb (data not shown). Figure 1 
summarizes the genomic organization of the porcine 
P-casein gene. Exon 1 encoded the 5'-UTR, while exon 2 
contained the translational initiator codon AUG. Exon 8 
contained the translation termination codon and exon 9 
contained the 3'-UTR. The sequence reported in this paper 
has been deposited in the GenBank database under 
accession no. EU242520. 

All intron/exon splice junctions conformed to the 
Gil AG rule. As shown in Table 1, exon 1 was 45 bp, exon 
2 was 63 bp, exon 3 was 27 bp, exon 4 was 27 bp, exon 5 
was 24 bp, exon 6 was 45 bp, exon 7 was 519 bp, exon 8 
was 42 bp, and exon 9 was 306 bp. With the exception of 
exon 7 and exon 9, all other exons very short. Additionally, 
intron 1 was 2,398 bp, intron 2 was 735 bp, intron 3 was 
118 bp, intron 4 was 1,325 bp, intron 5 was 94 bp, intron 6 
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1 h-H H 

B) Long-PCR 

h-H M ill 

C) p -casein genome (11. 5kb) 

E1 E2 E3E4 E5E6 E7 E8 E9 

Figure 1. Schematic diagram of overlapping PCR products encoding the porcine P-casein gene. Exons are indicated by numbers and 
closed boxes. Arrows indicate PCR primer. 



424 Lee et al. (2012) Asian-Aust. J. Anim. Sci. 25:421-427 



Table 1. Exon/intron organization of the porcine (3-casein gene 

Exon Exon Sequence at exon-intron junction 



No. size (bp) 5' splice donor 3' Splice acceptor 



1 


45 


GGA GAA AAG gtaagaattt 


ccattcacag GAC TTG ATC 


2,398 


2 


63 


GCA AGA GCG gtaagtacag 


ttctctatag AAG GAA GAA 


735 






A R A 


K E E 




3 


27 


TCT GGT GAG gtaagatatt 


tccttttcag ACT GTG GAA 


118 






S G E 


T V E 




4 


27 


AGC AGT GAG gtaagccaat 


ttttctaaag GAA TCT ATT 


1,325 






S S E 


E S I 




5 


24 


ATC AGC AAG gtaaagactt 


tgttttctag GAG AAA ATT 


94 






I S K 


E K I 




6 


45 


CAA AC A GAG gtaatttgtt 


ttctttccag GAT GAA CGC 


1,307 






Q T E 


D E R 




7 


519 


TAC AAC CCT gtaagtccaa 


aatttttaag GTC TAA GAG 


606 






Y N P 


V 




8 


42 


TCA CTT TTG gtaagcttta 


tattccgcag AAT TGA CTG 


784 


9 


305 









was 1,307 bp, intron 7 was 606 bp, and intron 8 was 734 bp. identified P-casein region containing the promoter was 

These results confirmed that the P-casein genome 11,517 bp. Furthermore, we were isolated the P-casein gene 

comprising exon 1 - exon 9 was 8,410 bp in length. The containing the mutant TATA box (CATAAAA) by LA-PCR 

promoter region was 3,107 bp, so the entire length of the of samples from another individual pig (Figure 2A). 



A) 



-> Exonl 



TATA: CjTATAAAAjAT AGCCACAGAA GTGGATGATT ATCCATTCAC CTCCTCCTTC ACTTCTTGTC 



mutant TATA: 



gcataaaa|at 



AGCCACAGAA GTGGATGATT ATCCATTCAC CTCCTCCTTC ACTTCTTGTC 




Relative luciferase activity 





P BCE1mTATA(-2657-+39) 


0.06010.0007 


+ 


pBCE1mTATA(-2657-»39) 


0.112±0.0004 




pBCE1TATA(-2657 -*39) 


0.042±O.O01 


+ 


pBCE1TATA(-2657-*39) 


0.26±0.0051 




pBCE2mTATA(-2657 - +2450) 


1.205±0.0118 


+ 


P BCE2mTATA(-2657 - +2450) 


2.165±0.0152 




PBCE2TATA (-2657 - *2450) 


1.914±0.0203 


+ 


PBCE2TATA (-2657 - *2450) 


4.01U0.065 




pGV-B2 




+ 


pGV-B2 


0.115+0.0008 




PGV-P2 


1.000 



Figure 2. Analysis of the promoter activity. (A) Nucleotide sequence of the porcine (3-casein promoter containing the normal and mutant 
TATA box. (B) Functional promoter activity of the 5 '-flanking region of the porcine P-casein gene fused to the firefly luciferase reporter 
gene. Each chimeric gene was cotransfected with the pCMV P-galactosidase plasmid into HC11 cells, and then assayed for firefly 
luciferase and P-galactosidase activities as described in the material and methods. Firefly luciferase activity in each individual 
experiment was corrected for variations in transfection efficiency by normalizing the value against the P-galactosidase activity in the 
same extract. The normalized activity of each promoter was then expressed relative to that of pGV-P2, with pGV-P2 assigned a relative 
activity of 1.0. The data represent the mean of triple transfection experiments for each plasmid. 
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Functional analysis of the 5'-flanking region of the 
porcine beta-casein gene 

To determine the transcriptional activity of the 5'- 
proximal region and intron 1 of the porcine P -casein gene in 
HC 11 cells by lactogenic hormones, the genomic DNA 
fragments containing the 5 '-flanking region -2,657 to +39 
(P/3CE1TATA) and -2,657 to +2,450 containing intron 1 
(p/5CE2TATA) were fused to the luciferase gene pGV-B2. 
Moreover, the luciferase vectors were constructed from 
genomic DNA containing mutant TATA with the 5 '-flanking 
region -2,657 to +39 (p/3CElmTATA) and -2,657 to +2,450 
containing intron 1 (p/5CE2mTATA) to validate the 
necessity of the normal or mutant TATA box in the promoter 
for expression of the P-casein gene. 

The luciferase activity of the p/5CElTATA and 
p/5CElmTATA without intron 1 were very low when 
compared with the positive control pGV-P2. The luciferase 
activity of p/5CElTATA and p/5CElmTATA were 0.042 and 
0.06, respectively. However, when lactogenic hormone was 
added, the luciferase activity of p/5CElTATA and 
p/5CElmTATA were 0.26 and 0.112, respectively. Moreover, 
the luciferase activity of p/5CElTATA was increased when 
compared with p/5CElmTATA in the hormone treatment or 
non-treatment group. p/3CE2mTATA and p/3CE2TATA 
containing intron 1 of the porcine P-casein gene showed 
remarkably high luciferase activity when compared with 
P/3CE1TATA and p/5CElmTATA without intron 1 in the 
hormone treatment or non treatment group. A high 
luciferase activity (4.011) was obtained from p/5CE2TATA 



subjected to lactogenic hormone treatment (Figure 2B). 

Figure 3 and 4 show the nucleotide sequence of the 5'- 
flanking region and intron 1 of the porcine P-casein gene. A 
computer-assisted search for the putative promoter/ 
enhancer element (set at a cutoff score of >90%) was 
conducted using TFSEARCH (http://mbs.cbrc.jp/research/ 
db/TFSEARCH.html). A typical TATA box was present at 
-32 and potential sites of C/EBPp (CCAAT/Enhancer- 
Binding Protein P) are present at -1,846 and -2,110 (data 
not shown). The 5' flanking region of the porcine P-casein 
gene also contained DNA motifs for SRY, CdxA, AML-a, 
GATA-3, and GATA-1. In addition, the first intron of the 
porcine P-casein gene contained transcriptional enhancer 
Oct-1, SRY, YY1, C/EBP P, AP-1, and the retroviral TATA 
box. 

DISCUSSION 

In this study, the porcine P-casein gene was identified 
by long PCR. This porcine P-casein gene consisted of nine 
exons and eight introns and was 11.57 kbp long, including 
its promoter. Comparison of the organization of the bovine 
and mouse P-casein genes, the porcine P-casein gene had 
the similar exon/intron organization with other species 
(Mercier and Viloite, 1993). In addition, the porcine 
P-casein gene containing the mutant TATA box 
(CATAAAA) was cloned from another individual pig. In 
this P-casein gene, which contained the mutant TATA box, 
the first Thymine was substituted with Cytosine. 



TATGAACATCTATCACTAAAGAGGC 



(SRY) 

CTGGCTCAGACTCTATTATAGCACG TTAGGTGAGA CCCTCCT 1 



GTGGGCAACTAAGAAATACTTTTGAGGC 
TTCTTTCCTCCTTCCATCAT 

TTGGCCTTCATGAATATTAGCTGACATACATTGATTCACTATAGAAGATATGAGACCAAACTTGAGAGGATCGATTT£IX 

(SRY) 

T.TTCTTTTCTTTCTTTCTTCTTCTTCTTCTTCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAAGGCTGCCCCCAAAGC 



ACTGGGAAGTTTCCAGGCCAGGGGTCAAATCAGAGTTGCAGCTGTSGTCT 

(AML-la) 



TACCACAGCCATAACAACACCGAACTGAA 
(AML-la) 



ACTGCACCTGTGACCAAGGTCTCTT GTGGGGATGCG GATTCTTAACCCAATGAGCAAGGCCCGGGATTGAACCTGCATCC 
(MZF1) (GATA-1) 

TCAT GGACACCATGTTTTGT TCTTGACCCGCTGAGCCACAAAGGGAAT TCCTAGA AGCACCAC TTTAATAA ATTTGTACC 
(YY1) (STATX) (CdxA) 

AGTATCATTTTTTTCTCTAGAAATATTCTCTAAATGCTACTTTCTGTCTTAAAACCCTTCAAAAAGTCCTC ACTATCTAG 

(GATA-3) 



GCTACTCCTTGCACTTCATAACCTAAATT 1 - A: 'TA I T '-ATT TAGTTGTAGTGACTTTGTTATCATGTTCAAGACCATATT 
(NKX-2) (HNF-3b) 

CTTCCTTTCGTTTTAGCCTACATCTCTCTTTCTTCCTTAGGTCTGAGCTGTCATCACAGGCTTGT CATGTCATTTTC TCC 

(Oot-1) 

GTCTGAGCTGTCATCACAGGCTTGTCATGTCATTTTCTCCATTATGGCATAAAATGGTAACATCTATTTAGTTAG£AlSa 

(Oct-1) 

AATAGTG ACCTTCGTGTGGCCTGTGTATCTCCAATACCTATTAGAATTTCCCCACAAGAAAGCTCTTGAAAACCTACCAA 

(MZF1) 

GTGCTAAAGAGACCTTATTGTGGCTACCATAACTTGGGGACTGGGCCAGAATGTCACTGTCCCCCAGCCAACGTCTGTAC 



TTATTC 



TTCCTGGATGGATTTTCJ 



;aaa: 'A'i : nrr.AAT'-'ATi-v, ;a: : 



(GATA-2) (GATA-1) 



(AML-la) 



(Oct-1) 



(TATA) 



IGGTAAATTATCCACTJCTTAAAATGCTCCTCAGAAT 
(Nkx-2) 

,TTCAAACTCACTATTGATTTTATT 
CATTCACCTCCTC 
Exonl 



(SRY) 

iCCACAGAAGTGGATGATTAT 



1268 
1188 
1108 
1028 
-948 
-868 
-788 
-708 
-628 
-548 
-468 
-388 
-308 
-228 
-148 
-68 
+ 13 



Figure 3. Nucleotide sequence of the 5 '-flanking region of the porcine P-casein gene. The residues preceding it are indicated by a 
negative number. Arrow below the sequence indicates the transcription start site. Potential binding sites for transcription factors (>90% 
cutoff score) are underlined. Sequence information is available under GenBank Accession number EU242520. 
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VGAAAAGGT, 

EXOnl < 1 (GATA-2) (SKY) 

GAACA TAGTTGCTTAAAAAA TGTA TTTAATA TATAGGGTATTTGTGGGTATTT AAGATAGT GCTGATACTATTTTCAGTACTTAGTTTAAAAAAATAGAA 

(C/EBP.) (CdxA) (GATA-2) (GATA-3) l S » Y > 

GTGCCTGTAAAGATT TGATAAAAATT TAAATGACCGTGTTGTATGAATCTAAC AAAATGTAGTAATGGTGA CTGCTATTTCCTTTAGTAAAAGACTAGTT 
(GATA-X) (C/EBPb) ^-T"-===_ 

AACAGGCTGTATTAAAAGATACATTTCTTGAATTATATGTCTCTCAAATTGATTAAACATA CCACAG CC ATAAAGG CAAATACATTTA ATTTATA GCATG 

(AML-1.) (CdxA) (CdxA) 

AGTATATGAAT AATCATTATTAAGAAAGT TTTAGCCCACAAATAGTTTATAGATGTTGCATGTCAGT CAAGAAATGGAGAGA TT TGTTTGGGAACAT GTG 

(Oct-l| (c/EBPb) «") (C/EBP) 

CTCCTTAAAATATTTATAAATAATATTTTTAGAAGAAGAGCATATTTGTCAG AATTGTTTAAAATATCCAAATTCT CAATT' 

(Sox-S) (C/EBP) (Oct-l| 

TTTCAAAACTATTAAAATAGATTCATAAAATACAGAATTAAATTAAAGAC AAACAAAATGTTTACTT TI 



) 

TGI 

(GATA-X) (HFH-1J (Oct-1) 



GTTTATCT GGGAACAAAAAAGAAGGAAAACATGATCTG AAATAAAGAATA GTTTAAAATTCATGGAGTTC' 



ATTGAATrCCTAGCrT AnBAATTTe»ATATG rCAAAAAAAAAAA Ar,Gf:AATAATAAAAA TrCTr,GCTTACAAATTTATAAATAAAATTATAATTATAAAA 
(Oct-1) (Oct-1) 

TAAAGAACATGTAATAATATCTTCCCTACTTGTAAAGAATTCTAATAAAGTACTATATTCTTTTTTTCAGTCTCTACAGTAAAAAA TATTTGGAAATG TT 

(C/EBP) (C/EBPb) 

TGAGAATTTCTGCATAAAGTAAAATCAATAAGGAATAAGACAGCACCCATGAGAAAAATCTAGGGCGAAAAGTCTTAATTTAGTAAAAATCTTGGATTSa 

AGATTGC GTGTCAAGGGATATTGGTGGCACAAA CATTATTTTTT TACAAAACTTTTAGAAATGCAGTAAAATCTACCATTTAGTACATTTTGGTCAAAAG 
(GATA-2) (HFH-2) (B!W-3b) 

TATTAACATATTTTATGCTATGCAGCCAGGAGMGTAGCCTAGAGTCGGGAAGTAGCCTGGAGTG5GGAAGTAGCCTGGAGTCGEGAGGTAGCCTGGAGT 

(KZFl) (MZP1) (MZFl) 

GGGG AGATAGCCTGGAGTGGTAAGCAGCCTAGAGTGG GGATTGGGCAAGT GAACCTGAC CTTTTGGTAAATC ATCAAGAGACAAGTGCACACATGTTCAC 
(MZFl) (GATA-1) C-^EBPb) (C/EBP) 

AGAGATGAAAGAAGGAGAGTGTATAATCTGCAC AATTAT GTGGCACAAAGGAGAAGAGATGCTGTTAGGTATG GGGGATTAAGACTGTGAGGTGGAT ATA 

(CdxA) |Ht-l) (HML-1.) 

TATGGAGAAGAAACAAACCAATCCGCCCAAGAAAA CATCTGGATAAGTG GAGCAAGTAAACG AAATACTA AAATGAAAAGGTCAGTCTTATGGAAATAAA 

(GATA-1) (CdxA) 

AAGCTATGTATAAGGCAATTATGCATATAAATCATTTAGTTGAATAAAGGTAAAATAAAATATGACCTTCTTTGTATGCA TTTTATCTGAT AACTTTCCT 

(GATA-1) 

CTCTGTAGAACTGAGGCTCCAATAGGGCTGAGACCTTACTATTTTGTTCACTGCTAACATTCCAGCACATAAGCAAAGTCTGG CAATAAAAA GTGCTAAA 

(TATA) 

(Oct-1) GCAC ° ° GCTG 0 GC 0 G G 00 CCAATCACTTGTG 

ATGTATGA CTGGGTCACTT TGCTGTAGAGCAGAAATCGACAGAACATTGTAAATCAACTGTACTTCAATTTTAAAAAATTTTAAAAGGTGCC AAATAAAT 
(AP-1) (HFH-2) 

AJ]TJJTTGAATCTATTAATTAAGCTTTCTGTCAATATTTGTAGCTTCTGTATTTCAA^AG_AAAAAACCAACATGTACTGTGAAATGAGATGAAAAAGATTG 

(SRI) 

AAGTA GGATAAGGCT GTTAGTGGAAAAAATCTGAATGGCTGGCAATGAAATAGGAACTTGACATGTAAGATTAAGGAGTCATAGTAGATGTTGACCCTGG 
(GATA-X) 

TCATATGACAAAATTTCTATCAGACAT TTTTATTACACC TTTGGTTTCT TTTAGAGGA AATAAAATAAGAAATATATTAGATTGACCAGTCATATACATT 



Figure 4. Nucleotide sequence of intron 1 of the porcine (3-casein gene. Potential binding sites for transcription factors (>90% cutoff 
score) are underlined. Sequence information is available under GenBank Accession number EU242520. 



Caseins are major milk proteins that account for nearly 
80% of the proteins in milk. Milk contains several caseins, 
asl-, as2-, fi- and K-casein, which are encoded by 
independent genes (Doppler et al., 1989; Rosen et al., 1999). 
Transcription of the casein gene in mammary epithelial 
cells is regulated by the lactogenic hormones, insulin, 
glucocorticoids, and prolactin. The synergistic action of 
glucocorticoid and prolactin hormones on the rat and pig 
(3-casein promoter has been examined in the murine 
mammary epithelial cell line, HC11 (Doppler et al., 1989; 
Boudreau et al., 1995; Rosen et al., 1999; Lee et al. 2008). 

Functional analysis of the 5'-flanking region of the 
porcine P-casein gene was conducted in HC11 cells using 
the genomic DNA fragment containing the 5 '-flanking 
region -2,657 to +39 and -2,657 to +2,450, which includes 
intron 1 in the P-casein gene containing mutant TATA and 
normal TATA. In the p-casein gene containing the normal 
TATA box, the luciferase activity of the 5 '-flanking region 
containing intron 1 (-2,657 to +2,450) by lactogenic 
hormone treatment was 15 fold higher than that of the 5'- 
flanking region not containing intron 1 (-2,657 to +39). Our 
results are consistent with previous data, which suggests 



that expression of the P-casein gene is regulated by 
lactogenic signals via intron 1 (Kolb, 2003; Lee et al., 2008). 
Moreover, the luciferase activity of the 5 '-flanking region 
containing the mutant TATA box in the promoter region was 
lower than that of the 5 '-flanking region containing the 
normal TATA box. It was previously reported that the TATA 
box in the vasopressin and P-globin gene have a deviant 
CATAAAA box (Xu et al., 1991; Ho and Murphy, 2002). 
These mutant TATA boxes have also been shown to elicit 
relatively weak promoter activity (Xu et al., 1991; Ho and 
Murphy, 2002). The results of the present study indicate 
that the TATA box in the promoter region plays an 
important role in expression of the P-casein gene. 

The 5'flanking region of the porcine P-casein gene also 
contained DNA motifs for SRY, CdxA, AML-a, GATA-3, 
GATA-1, and C/EBPa. However, the GR, AP-1, STAT5 and 
YY-1 DNA motifs were not found in this region of the 
porcine P-casein gene. The first intron of the porcine 
P-casein gene contained the putative Oct-1, SRY, YY1, 
C/EBPP, AP-1, and retroviral TATA box. 

The glucocorticoid receptor (GR), AP-1 (fos/jun dimer), 
STAT5 and YY-1 (Ying Yang-1) have been implicated in 
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P-casein gene expression in mice and humans (Mercier and 
Viloite, 1993; Rosen et al., 1999). However, these DNA 
motifs were not found in the 5 '-flanking region of the 
porcine P-casein gene. It has previously been reported that 
the first intron of the mouse and bovine P-casein gene 
contains enhancer elements (Kang et al., 1998; Kolb, 2003). 
The first intron of the mouse P-casein gene contains 
putative GR, YY1, C/EBP, Oct-1, AP-1 and TATA box 
(Kolb, 2003). In this study, the first intron of the porcine 
P-casein gene contained the transcriptional enhancers Oct-1, 
SRY, YY1, C/EBPP, and AP-1, and the retroviral TATA box. 
However, the DNA motif of the glucocorticoid receptor was 
not found in the first intron of the P-casein gene. These 
potential sites may be required for basal transcription and 
regulation of the porcine P-casein gene by the lactogenic 
hormone. 

Here, we demonstrated that the first intron of the 
porcine P-casein gene contains a transcriptional promoter/ 
enhancer sequence and that lactogenic expression of the 
porcine P-casein gene was probably regulated by the first 
intron. However, the mutant TATA box (CATAAAA) in the 
porcine P-casein gene has weak promoter activity. Further 
studies are required to elucidate the actual nature and 
regulation of the porcine P-casein gene. Thus, these data 
can serve as a foundation for mammary gland-specific 
expression of foreign genes in gene targeting pigs. 
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